Three-fold utilization of supplementary information for mean estimation under median ranked set sampling scheme

Ranked set sampling (RSS) has created a broad interest among researchers and it is still a unique research topic. It has at long last begun to find its way into practical applications beyond its initial horticultural based birth in the fundamental paper by McIntyre in the nineteenth century. One of the extensions of RSS is median ranked set sampling (MRSS). MRSS is a sampling procedure normally utilized when measuring the variable of interest is troublesome or expensive, whereas it might be easy to rank the units using an inexpensive sorting criterion. Several researchers introduced ratio, regression, exponential, and difference type estimators for mean estimation under the MRSS design. In this paper, we propose three new mean estimators under the MRSS scheme. Our idea is based on three-fold utilization of supplementary information. Specifically, we utilize the ranks and second raw moments of the supplementary information and the original values of the supplementary variable. The appropriateness of the proposed group of estimators is demonstrated in light of both real and artificial data sets based on the Monte-Carlo simulation. Additionally, the performance comparison is also conducted regarding the reviewed families of estimators. The results are empowered and the predominant execution of the proposed group of estimators is seen throughout the paper.


Introduction
The ranked set sampling (RSS) design was primarily developed by [1] as an effective inspection plan for estimating the population parameters, i.e., the mean of field and scrounge yields. This inspection plan is appropriate in circumstances where the ranking of perceptions is effectively dependent on a supplementary variable, which is linearly related to the subject variable or any reasonable technique. The RSS has wide applications in numerous logical issues, particularly in natural and environmental examinations where the principal concentration is on practical and productive sampling/inspection techniques. For instance, assume that the agency of environmental protection needs to guarantee that the fuel stations in certain metropolitan territories are distributing gas according to air clean guidelines. It is quite possible the compound parameters of fuel can be effectively ranked right after the assortment at the fuel siphon by some rough field strategies being modest and simple. While carrying the sample units to the research center and utilizing certain lab procedures to gauge its compound parameters is costly. For more details about RSS and its applications, see, for example, [2,3] who suggested the median ranked set sampling (MRSS) for estimating the population mean. The MRSS consists of the following steps: Step 1: Select n random samples, each of size n units from the population and rank the units within each sample with respect to the variable of interest.
Step 2: If the sample size n is odd, then from each sample, select for measurement of the nþ1 2 À � th smallest rank (the median of the sample). If the sample size n is even, then select for measurement of the n 2 À � th smallest rank from the first n 2 samples, and the nþ2 2 À � th smallest rank from the second n 2 samples. The cycle can be repeated m times if needed to obtain a sample of size nm. [4,5] suggested multistage median ranked set sampling for estimating the population mean and median, respectively. [6] considered the modified ratio estimator for the population mean using double MRSS. For more details of RSS and its applications, see [7][8][9][10][11][12][13][14][15][16][17]. [12] considered the median ranked set sampling scheme and built ratio-type estimators based on known quartiles of supplementary information. [18] extended the work of [12] and suggested new families of estimators. These families also contain known mean and quartiles of supplementary information. In this paper, some new estimators for the population mean of the random variable Y are proposed under MRSS scheme. The basic idea is when there exists an adequate relationship (sufficient correlation) between (Y, X), the ranks and second raw moment of the supplementary variable are also correlated with the variate of interest. In light of this reality, we will utilize three forms, i.e., (the original values, ranks and the second raw moment) of a single supplementary variable X. The proposed three-fold utilization of the supplementary variable will provide enhancement in the estimation of population mean μ y . So, we build some estimators for estimating μ y which not only includes the original values of the auxiliary variable X but also on the second raw moment and ranks of the supplementary variable. The appropriateness of the recommendation is additionally shown empirically by utilizing real and artificial populations. The execution assessment uncovers the prevalent presentation of the proposed family in contrast with [12,18] group of estimators.
The rest of the paper is composed of pursues. In Section 1.1, we present documentation (notations) and a concise review of the estimators utilized in MRSS when a supplementary variable is accessible. For every estimator, the theoretical properties of order n −1 are derived. In Section 2, we extend the idea of [12,18], in light of three-fold utilization of supplementary information of a single auxiliary variable, and propose three new estimators for estimating μ y under MRSS with their theoretical properties. In Section 3.1, we perform a reenactment (simulation) study to look at, under changed situations, the productivity of the proposed estimator with that of other focused estimators present in the literature. In Section 3.2, the productivity of the estimator is researched by utilizing a real-life data set. Finally, the conclusion is provided in Section 4.

Preliminaries with respect to median ranked set sampling
Suppose a median ranked set sample of size n is drawn from a finite population ω, consisting of N units. Let Y and X are study and auxiliary variables, respectively. Further, let us assume that Z and V representing; the ranks and squared of each value of the supplementary variable, respectively.
For odd sample size, let X 1 1þn When there is a significant degree of linear relationship exists between the subject variable Y and the supplementary variable X, and the mean and quartiles of X are known in advance, [12] developed two mean ratio type estimators under MRSS design: where μ x is the population mean of the random variable X and (q 1 , q 3 ) represent the upper and lower quartiles of the supplementary variable X. These estimators can be refortified as where j = (E, O) represents the odd and even sample selection. Further, (k = 1, 3) denotes the first and third quartiles, respectively. The Bias and mean square error (MSE) expressions for order n −1 of � y qkðjÞ are

Biasð� y qkðjÞ Þ ffi
if n is odd MSEð� y qkðjÞ Þ ffi if n is odd yl ðOÞ m y y 2 1 s 2 where y ¼ am x am x þb ; for (a = 1, b = q k ) and y 1 ¼ m y ðm x þq k Þ . The ratio estimators determined in [12] are most suitable for the positive degree of linear relationship, i.e., (ρ xy(j) > 0). Similarly, for the negative degree of a linear relationship, i.e., (ρ xy(j) < 0), the product versions of these estimators can be developed by taking the reciprocal of ratio type component. To defeat the constraint related to the indication of ρ xy(j) , [18] developed a usual regression-estimator under MRSS with their MSE expression as given by MSEð� y rgðjÞ Þ ffi where b xy(j) is the regression coefficient. [19] constructed a difference type estimator based on two tuning parameters, known as generalized regression estimator, under SRS design. By merging the idea of [12,18,19] also introduced a difference type estimators under MRSS as follows: where k 1(j) and k 2(j) are wisely chosen constants. The bias and MSE expressions are determined by

PLOS ONE
Three-fold utilization of supplementary information for mean estimation under MRSS

Materials and methods
The reasonable utilization of the supplementary information helps in expanding the exactness of an estimator both at the structuring stage as well as at the estimation stage, see [21][22][23][24][25][26][27][28][29][30][31][32][33]. In social, monetary, and regular studies, the total supplementary information is frequently accessible to the inspected outline. The basic concept is when there exists an adequate measure of the relationship between (X, Y); the rank and second raw moments are also connected (correlated) with the variate of interest. Consequently, the ranked supplementary variable V (which is based on the ranks of X) can be treated as a type of supplementary variable. Similarly, the second raw moment auxiliary variable Z (which is based on the second raw moment of X) can be treated as another supplementary variable. Hence, this three-fold supplementary information can help us in expanding the proficiency of the estimators. In view of these thoughts, we propose three new estimators of the finite population mean in the upcoming sub-sections of the current section.

First proposed estimator
Taking motivation from [12,18] and utilizing two out of three defined forms of auxiliary information (X, Z), we propose the following estimator as To obtain the bias and the MSE, we define If the sample size n is odd, we can write If the sample size n is even, we can write Expressing the proposed estimator � y P1 in terms of ε's, we have By applying the expectation to the above expression, we determine the theoretical expression of bias of degree n −1 as if n is even where Taking the square and the expectation of (18), we obtain the MSE expressions of � y P1 . After partial differentiation of the MSE, we get the optimum values of weights, i.e., k 1 , k 2 , and k 3 . By substituting the optimum values of weights, we determine the minimum MSE expressions.
The expressions of optimum weight and the minimum MSE are as follows. where if n is even where where

Second proposed estimator
Taking the motivation from � y P1 and utilizing two out of three defined forms of auxiliary information, i.e., (X, V), we propose the following estimator.
To obtain the bias and the MSE, we define If the sample size n is odd, we have If the sample size n is even, we write Accordingly, the bias and minimum MSE can be calculated from the expressions provided in previous section, by replacing the second raw moment auxiliary variable Z with the corresponding ranked supplementary variable V. Hence, we have Biasð� y P2 Þ ffi if n is odd if n is even where where Þ: For abbreviation, we intentionally skip the expressions of k 1 ; k 2 ; k 3 ; P 2 ðOÞ and P 2 ðEÞ . Interested readers may easily get the mathematical expressions of these quantities by simply replacing z with v, regarding the previous sub-section.

Third proposed estimator
Taking motivation from � y P1 and � y P2 and utilizing all the three defined forms of auxiliary information, i.e., (X, V, Z), we propose the following estimator where where The bias of � y P3 is

Biasð� y P3 Þ ffi
if n is odd m y ðk 1ðOÞ À 1Þ þ 3 8 l ðOÞ y 2 m y k 1ðOÞ ðm 2 x Þ À 1 s 2 Taking the square and the expectation of (30), we get the MSE expressions of � y P3 . Then, by partial differentiation of the MSE, we determine optimum values of weights, i.e., k 1 , k 2 , k 3 , and k 4 . By substituting the optimum values of weights, we obtain the minimum MSE expressions. The optimum weights and minimum MSE are as follows.
3 Results and discussion

Simulation study
While assessing the performance of the new recommended estimators, it is traditional to infer the MSE based conditions under which an estimator is more efficient than the others. However, these MSE based conditions are generally hard to confirm. Therefore, we have abstained intentionally from these conditions dependent on the MSE expressions and lean towards various numerical reenactment tests (simulation experiments), in the current Section. In the next Section, we will also assess the performance on behalf of the real life data set. We perform the simulation study for evaluating the features of new estimators, i.e., ð� y P1 ; � y P2 ; � y P3 Þ over existing ones, i.e., ð� y q1ðjÞ ; � y q3ðjÞ ; � y rgðjÞ ; � y N1ðjÞ ; � y N3ðjÞ ; ; � y Kk1ðjÞ ; � y Kk3ðjÞ Þ in MRSS. In this section, we conduct re-enactment tests to investigate the properties of the proposed estimators. In reenactment tests, we generate a population of size 10000 from a bivariate normal distribution Nðm x ¼ 2; m y ¼ 4; s 2 x ¼ 1; s 2 y ¼ 1; r xy Þ where, we assumed values of ρ xy = {±0.95, ±0.75, ±0.50, ±0.35, ±0.25, ±0.15}. For more details, interested readers may be referred to [18]. It is worth mentioning that only one form of the supplementary information, i.e., the supplementary variable X, generated from the bi-variate normal distribution. However, the remaining two forms of the supplementary information, i.e., (V, Z), are developed by taking ranks and square of X.
We consider two situations: First one is based on the true descriptives of population while the later one is based on the estimated descriptives of population.

Simulation with known population descriptives
In order to determine the percentage relative mean square error (PRMSE) of the concerned estimators, we consider both even and odd types of sample sizes, in light of the preliminaries defined in Section 2. We consider n = 3, 5 as odd and n = 4, 6 as even sample size. It is worth mentioning that each mentioned MRSS sample of size even/odd, selected 5000 times and the averaged MSE calculated. The PRMSE is calculated as: In the current sub-section, a simulation design is based on the assumption that complete supplementary information is available. Thus, the true population parameters of supplementary information are considered here.

Simulation with unknown population descriptives
The configuration of the simulation portrayed in the previous sub-section depends on the supposition that all descriptives of the population regarding both the supplementary and the subject variables, and shown in the equations of the estimators, are known, except for μ y . Without doubt, in spite of the fact that this circumstance has a hypothetical natural worth, its utility might be faulty in genuine examinations where some of the descriptives of supplementary variate are obscure. Thus, we have additionally investigated the adequacy of the proposed estimators under the reasonable circumstances where the previously mentioned population descriptives are obscure and should be assessed on the grounds that no dependable guess is accessible from past information, specialist advice or a pilot study. In such a condition, estimates of the objective parameters are influenced by an additional wellspring of variability; their conduct might be non-unique in relation to the situation where the population descriptives are thought to be given. To reveal insight into the matter, we address the issue of assessing the impact on the productivity of the reviewed and proposed estimators when all population descriptives, except for μx, are evaluated on the premise of various samples with different sizes. It is worth mentioning that the estimated optimum weights of the considered estimators are utilized for this particular situation.

Real life application
In this section, the estimators addressed in the previous section are assessed using real life data. Similar frame work of the previous section is adapted here. The data belongs to the production of corn and extension of agriculture land. Unit of production is "quintal" and

Interpretation of numerical results
The analysis of the designed based simulation and the real life application are given in Tables  1-5, respectively. From these Tables, we notice that all proposed estimators � y P1 À � y P3 , beat � y q1 , � y q3 , � y rg , � y Nk1 , � y Nk2 , � y Kk1 , � y Kk2 . Moreover, � y rg is always better than � y q1 À � y q3 . No consistent improvement is found between � y Kk1 À � y Kk2 . However, both of these are better compared to their reviewed counterparts � y q1 ; � y q3 ; � y rg ; � y Nk1 ; � y Nk2 . We see that the proposed estimators are more effective than the reviewed counterparts. The PRMSE values decrease as the correlation coefficient values increase. Also, for fixed values of the correlation coefficient and the same estimator, the MSE values decrease as the sample size increases for both true and estimated parameters. This examination reveals the superiority of the proposed estimators � y P1 À � y P3 over the reviewed estimators. Among all PRMSE estimators, computations are given in Tables 1-5, it can be noted that the � y P3 is more efficient in terms of the PRMSE. These are the results for the circumstances considered in the numerical study, conducted for the purposes of the paper. Hence, we feel sure that the equivalent could hold in other settings.

Conclusion
In the current study, we have proposed some new estimators for the mean estimation of a subject variable when supplementary information is accessible. We explore the three-fold utilization of a single supplementary variable under MRSS design. Our class sums up the usual regression estimator and difference estimator of [19], due to [18], under MRSS design. We have also calculated the asymptotic theoretical properties such as the bias and minimum MSE. These properties are helpful to pay attention to an underestimation of the population parameter of the subject and the variable, i.e., μ y . The proposed estimators are not exhaustive but rather can go about as an obstruction against the expansion of equivalent propositions that could show up later on. The proficiency of the proposed estimator has been explored by comparing it with different reviewed estimators, based on hypothetical and practical examples. For this, numerical examinations on some genuine and artificial populations have been conducted by Monte Carlo studies. These numerical examinations show the prevalence of the proposed estimators over the existing ones. Thus, the new proposals, based on three-fold utilization of a single supplementary variable, are recommended for survey practitioners. In future studies, the proposed three fold utilization can be extended for two stages MRSS in light of [12,18,34] work.